28 



(1) GENERAL INFORMATION: 




SEQUENCE LIST IN6 



10 



15 




PLICANT: 

(A) NAME: MOGEN INTERNATIONAL NV 
B) STREET: Einsteinweg 97 
) CITY: Leiden 
\) COUNTRY: The Netherlands 
POSTAL CODE (ZIP) : 2233 CB 
TELEPHONE: (31) 71-5258282 
.TELEFAX: (31) 71-5221471 

(ii) TITLE bF INVENTION: Enhanced accumulation of trehalose in plants 

(iii) NUMBER OF SEQUENCES: 27 



iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.25 (EPO) 



30 



35 



(2) INFORMATION FOR SEQ\ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 62 :\ base pairs 

(B) TYPE: nucleiaacid 

(C) STRANDEDNESS : \double 

(D) TOPOLOGY: line) 

(ii) MOLECULE TYPE: cDNA t\ mRNA 
(iii) HYPOTHETICAL: NO 



40 



45 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 25.. 2485 
(D) OTHER INFORMATION: /fundtion= "trehalose phosph. 

synthase and trehalos^phosph. phosphatase" 
/product** "bipartite eir^yme" 

(ix) FEATURE : 

(A) NAME/KEY: unsure 

(B) LOCATION: 1609.. 1611 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CTGATCCTGC GGT^TCATCA CAAT ATG ATA CTC TTA CAT CTG ATG CCC CTT 

Met lie Leu Leu His Leu Met Pro Leu 
1 5 



51 



10 



CAG ATG CTC CCA A&T AGG TTG ATT GTC GTA TCG AAT CAG TTA CCC ATA 
Gin Met Leu Pro Asn Arg Leu lie Val Val Ser Asn Gin Leu Pro lie 
10 



ATC GCT AGG CTA AGA 
lie Ala Arg Leu Arg 
30 



STA ACG ACA ATG GAG GGT CCT TTT GGG ATT TCA 
ju Thr Thr Met Glu Gly Pro Phe Gly lie Ser 
35 40 



99 



147 



15 CTT GGG ACG AGA GTT CGAVTTT ACA TGC ACA TCA AAG ATG CAT TAG CCG 
Leu Gly Thr Arg Val Arg Phe Thr Cys Thr Ser Lys Met His Tyr Pro 
45 \ 50 55 



195 




CAG CCG TTG AGG TTT TCT AT* CTT GGC GAT CCA CTA AGG GCT GAC GTT 
20 Gin Pro Leu Arg Phe Ser Ile\l#eu Gly Asp Pro Leu Arg Ala Asp Val 
60 \ 65 70 

CCT ACC GAA CAA GAT GAC G*G TCA AAG ACA TTG CTC GAT AGG TTT 
Gly Pro Thr Glu Gin Asp Asp VaU. Ser Lys Thr Leu Leu Asp Arg Phe 
75 80 \ 85 



AAT TGC GTT GCG GTT TTT GTC CCT 
Asn Cys Val Ala Val Phe Val Pro 
90 95 



E^CT TCA AAA TGG GAC CAA TAT TAT 
ir Ser Lys Trp Asp Gin Tyr Tyr 
100 105 



CAC TGC TTT TGT AAG CAG TAT TTG TGGXCCG ATA TTT CAT TAC AAG GTT 
His Cys Phe Cys Lys Gin Tyr Leu Trp Vro lie Phe His Tyr Lys Val 
110 il5 120 



243 



291 



339 



387 



35 CCC GCT TCT GAC GTC AAG AGT GTC CCG AAT AGT CGG GAT TCA TGG AAC 
Pro Ala Ser Asp Val Lys Ser Val Pro Asn\ser Arg Asp Ser Trp Asn 
125 130 \ 135 



435 



GCT TAT GTT CAC GTG AAC AAA GAG TTT TCC C/fiG 
40 Ala Tyr Val His Val Asn Lys Glu Phe Ser Gin 
140 145 



AAG GTG ATG GAG GCA 
Lys Val Met Glu Ala 
150 



483 



45 



GTA ACC AAT CGT AGC AAT TAT GTA TGG ATA CAT GflC TAC CAT TTA ATG 
Val Thr Asn Arg Ser Asn Tyr Val Trp lie His Asp^Tyr His Leu Met 
155 160 165 



531 



50 



ACG CTA CCG ACT TTC TTG AGG CGG GAT TTT TGT CGT TO?* AAA ATC GGT 579 
Thr Leu Pro Thr Phe Leu Arg Arg Asp Phe Cys Arg PheVLys lie Gly 
170 175 180 \ 185 

TTT TTT CTG CAT AGC CCG TTT CCT TCC TCG GAG GTT TAC AA(V ACC CTA 627 
Phe Phe Leu His Ser Pro Phe Pro Ser Ser Glu Val Tyr Lys \hr Leu 
190 195 2\o 



30 



CCA ATG aga aac gag ctc ttg aag ggt ctg tta aat gct gat ctt atc 
Pro M^t Arg Asn Glu Leu Leu Lys Gly Leu Leu Asn Ala Asp Leu lie 
205 210 215 



675 



GGG TTC CAT ACA TAC GAT TAT GCC CGT CAT TTT CTA ACG TGT TGT AGT 
Gly Phe Wis Thr Tyr Asp Tyr Ala Arg His Phe Leu Thr Cys Cys Ser 
220 225 230 



723 



CGA ATG TTT\GGT TTG GAT CAT CAG TTG AAA AGG GGG TAC ATT TTC TTG 
10 Arg Met Phe ©ly Leu Asp His Gin Leu Lys Arg Gly Tyr lie Phe Leu 
235 \ 240 245 



771 



15 



GAA TAT AAT GG A AGG AGC ATT GAG ATC AAG ATA AAG GCG AGC GGG ATT 
Glu Tyr Asn GlyvArg Ser lie Glu lie Lys lie Lys Ala Ser Gly lie 
250 \ 255 260 265 



819 



20 



CAT GTT GGT CGA ATG GAG TCG TAC TTG AGT CAG CCC GAT ACA AGA TTA 
His Val Gly Arg Mea Glu Ser Tyr Leu Ser Gin Pro Asp Thr Arg Leu 
270\ 275 280 



CAA GTT CAA GAA GTC 
Gin Val Gin Glu Val 
285 



AAA CGT TCG AAG GAA ATC GTG CTA CTG GGA 
Lys Arg Ser Lys Glu lie Val Leu Leu Gly 
290 295 



867 



915 




25 GTT GAT GAT TTG GAT ATA T*C AAA GGT GTG AAC TTC AAG GTT TTA GCG 
Val Asp Asp Leu Asp lie Ph& Lys Gly Val Asn Phe Lys Val Leu Ala 
300 \305 310 



TTG GAG AAG TTA CTT AAA TCA C> 
Leu Glu Lys Leu Leu Lys Ser 
315 320 

AAG GTG CAA ATC TTG AAT CCT CTG 
Lys Val Gin lie Leu Asn Pro Leu 
330 335 



CCG AGT TGG CAA GGG CGT GTG GAA 
Pro Ser Trp Gin Gly Arg Val Glu 
325 

CGT TGC CAA GAC GTC GAT GAG 
Arg Cys Gin Asp Val Asp Glu 
340 345 



963 



1011 



1059 



40 



ATC AAT GCC GAG ATA AGA ACA GTC TGT AGA ATC AAT AAC GAA CTG 1107 

lie Asn Ala Glu lie Arg Thr Val Cys Glu, Arg lie Asn Asn Glu Leu 
350 355\ 360 

GGA AGC CCG GGA TAC CAG CCC GTT GTG TTA AT<T GAT GGG CCC GTT TCG 1155 
Gly Ser Pro Gly Tyr Gin Pro Val Val Leu Ile\Asp Gly Pro Val Ser 
365 370 \ 375 



45 TTA AGT GAA AAA GCT GCT TAT TAT GCT ATC GCC GAT^ATG GCA ATT GTT 
Leu Ser Glu Lys Ala Ala Tyr Tyr Ala lie Ala Asp «et Ala lie Val 
380 385 3i 



1203 



ACA CCG TTA CGT GAC GGA CTG AAT CTT ATC CCG TAC GAG $AC GTC GTT 
50 Thr Pro Leu Arg Asp Gly Leu Asn Leu lie Pro Tyr Glu Ty* Val Val 
395 400 405 



1251 



31 



\TCC CGA CAA AGT GTT AAT GAC CCA AAT CCC AAT ACT CCA AAA AAG AGC 
Arg Gin Ser Val Asn Asp Pro Asn Pro Asn Thr Pro Lys Lys Ser 
LO 415 420 425 



1299 



AT(5v CTA GTG GTC TCC GAG TTC ATC GGT GTT TCA CTA TCT TTA ACC GGG 
MetVeu Val Val Ser Glu Phe He Gly Val Ser Leu Ser Leu Thr Gly 
430 435 440 



1347 



GCC ATA CGG GTC AAC CCA TGG GAT GAG TTG GAG ACA GCA GAA GCA TTA 
10 Ala Ile\Arg Val Asn Pro Trp Asp Glu Leu Glu Thr Ala Glu Ala Leu 
445 450 455 



1395 



15 



TAC GAC GCA CTC ATG GCT CCT GAT GAC CAT AAA GAA ACC GCC CAC ATG 
Tyr Asp AlaNLeu Met Ala Pro Asp Asp His Lys Glu Thr Ala His Met 
460 \ 465 470 



1443 



20 




AAA CAG TAT CAA TAC ATT ATC TCC CAT GAT GTA GCT AAC TGG GCT AGC 
Lys Gin Tyr Gin Vyr He He Ser His Asp Val Ala Asn Trp Ala Ser 
475 \ 480 485 

TTC TTT CAA GAT TTAVGAG CAA GCG TGC ATC GAT CAT TCT CGT AAA CGA 
Phe Phe Gin Asp Leu )Slu Gin Ala Cys He Asp His Ser Arg Lys Arg 
490 4V5 500 505 

TGC ATG AAT TTA GGA TTT\GGG TTA GAT ACT AGA GTC GTC TTT TTG ATG 
Cys Met Asn Leu Gly Phe Sly Leu Asp Thr Arg Val Val Phe Leu Met 
510 \ 515 520 



AGA AGT TTA GCA AGT TGG ATA^ 
Arg Ser Leu Ala Ser Trp He 
525 



\G ATG TCT TGG AAG AAT GCT TAT TCC 
fs Met Ser Trp Lys Asn Ala Tyr Ser 
530 535 



35 



ATG GCT CAA AAT CGG GCC ATA CTT 
Met Ala Gin Asn Arg Ala He Leu 
540 545 



fTG GAC TAT GAC GGC ACT GTT ACT 
ji Asp Tyr Asp Gly Thr Val Thr 
550 



1491 



1539 



1587 



1635 



1683 



40 



CCA TCT ATC AGT AAA TCT CCA ACT GAA 
Pro Ser He Ser Lys Ser Pro Thr Glu 
555 560 



GTT ATC TCC ATG ATC AAC 
Val He Ser Met He Asn 
565 



AAA CTG TGC AAT GAT CCA AAG AAC ATG GTG TT(V ATC GTT AGT GGA CGC 
Lys Leu Cys Asn Asp Pro Lys Asn Met Val Phe \le Val Ser Gly Arg 
570 575 580 \ 585 



1731 



1779 



45 AGT AGA GAG AAA ATC TTG GCA GTT GGT TCG GCG CGT 
Ser Arg Glu Lys He Leu Ala Val Gly Ser Ala Arg 
590 595 



AGA ACC CGC 
Arg Thr Arg 
600 



1827 



CAT TGC ACT GAG CAC GGA TAC TTT ATA AGG TGG GCG GGT GA\ CAA GAA 
50 His Cys Thr Glu His Gly Tyr Phe He Arg Trp Ala Gly AspN^ln Glu 
605 610 615 



1875 



32 



T^G GAA ACG TGC GCA CGT GAG AAT AAT GTC GGG TGG ATG GAT GGA AAT 
Trp^Glu Thr Cys Ala Arg Glu Asn Asn Val Gly Trp Met Asp Gly Asn 
620 625 630 



1923 



CTG AGGXXXS GTT ATG AAT CTT TAT ACA GAA ACT ACT GAC GGT TCG TAT 
Leu Arg Pro Val Met Asn Leu Tyr Thr Glu Thr Thr Asp Gly Ser Tyr 
635 \ 640 645 



1971 



ATT GAA AAG AAAvGAA ACT GCA ATG GTT TGG CAC TAT GAA GAT GCT GAT 
10 lie Glu Lys Lys Glu Thr Ala Met Val Trp His Tyr Glu Asp Ala Asp 
650 \ 655 660 665 



2019 



15 



AAA GAT CTT GGG TTG GAG CAG GCT AAG GAA CTG TTG GAC CAT CTT GAA 
Lys Asp Leu Gly Leu Glu\Gln Ala Lys Glu Leu Leu Asp His Leu Glu 
670 \ 675 680 



2067 



20 




35 



AAC GTG CTC GCT AAT GAG CCC\GTT GGA GTG AAT CGA ACA GGT CAA TAC 2115 
Asn Val Leu Ala Asn Glu Pro Val Gly Val Asn Arg Thr Gly Gin Tyr 
685 \ 690 695 

ATT GTA GAA GTT AAA CCA CAG TCC <5CC ATT AAT TAC CTT CTT GTT ATG 2163 
lie Val Glu Val Lys Pro Gin Ser Ptfo lie Asn Tyr Leu Leu Val Met 
700 705 \ 710 

ACA TTC ATA GGC ACT GAT TGT AGA ATC T^ AAC TTA AAT TTC TTT AAA 2211 
Thr Phe lie Gly Thr Asp Cys Arg lie Phk Asn Leu Asn Phe Phe Lys 
715 720 \ 725 

TAT GAA TGC AAT TAT AGG GGG TCA CTA AAA GOT ATA GTT GCA GAG AAG 2259 
Tyr Glu Cys Asn Tyr Arg Gly Ser Leu Lys Gly\Ile Val Ala Glu Lys 
730 735 740 \ 745 

ATT TTT GCG TTC ATG GCT AAA AAG GGA AAA CAG GCx\GAT TTC GTG TTG 2307 
lie Phe Ala Phe Met Ala Lys Lys Gly Lys Gin Ala \sp Phe Val Leu 
750 755 \ 760 



40 



ACG TTG AAT GAT AGA AGT GAT GAA GAC ATG TTT GTG GCC MT GGG GAT 2355 
Thr Leu Asn Asp Arg Ser Asp Glu Asp Met Phe Val Ala I4e Gly Asp 
765 770 771 

GGA ATA AAA AAG GGT CGG ATA ACT AAC AAC AAT TCA GTG TTT A^ TGC 2403 
Gly lie Lys Lys Gly Arg lie Thr Asn Asn Asn Ser Val Phe ThryCys 
780 785 790 



45 GTA GTG GGA GAG AAA CCG AGT GCA GCT GAG TAC TTT TTA AAT GAT GTC 
Val Val Gly Glu Lys Pro Ser Ala Ala Glu Tyr Phe Leu Asn Asp Val 
795 800 805 



2451 



TCG AGA AGC TCC GGG TGT CTC AGC AAC CAA GGA T GATCCGGAAG 
50 Ser Arg Ser Ser Gly Cys Leu Ser Asn Gin Gly 
810 815 820 



CTTCTCGTGA TCTTTATGAG TTAAAAGTTT TCGACTTTTT CTTCATCAAG ATTCATGGGA 



33 

AAGTTGTTCA ATATGAACTT GTGTTCTTGG TTCTGGATTT TAGGGAGTCT ATGGATATAA 2615 
VCATTTC 2621 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



(i& MOLECULE TYPE: protein 
(xi)\SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met lie LeOkLeu His Leu Met Pro Leu Gin Met Leu Pro Asn Arg Leu 

1 \ 5 10 15 

20 lie Val Val Se\ Asn Gin Leu Pro lie lie Ala Arg Leu Arg Leu Thr 

20\ 25 30 




Thr Met Glu Gly Pro\Phe Gly lie Ser Leu Gly Thr Arg Val Arg Phe 
35 \ 40 45 



Cys Thr Ser Lys MetNHis Tyr Pro Gin Pro Leu Arg Phe Ser lie 
50 >65 60 



Leu Gly Asp Pro Leu Arg Ala >Asp Val Gly Pro Thr Glu Gin Asp Asp 
65 70 \ 75 80 



Val Ser Lys Thr Leu Leu Asp Arg^Phe Asn Cys Val Ala Val Phe Val 
85 \ 90 95 

Pro Thr Ser Lys Trp Asp Gin Tyr Tyr^His Cys Phe Cys Lys Gin Tyr 
100 105 \ 110 



40 



Leu Trp Pro lie Phe His Tyr Lys Val Pro Via Ser Asp Val Lys Ser 
115 120 \ 125 

Val Pro Asn Ser Arg Asp Ser Trp Asn Ala Tyr yal His Val Asn Lys 

130 135 l) 



45 



Glu Phe Ser Gin Lys Val Met Glu Ala Val Thr Asn Arg Ser Asn Tyr 
145 150 155 \ 160 



Val Trp lie His Asp Tyr His Leu Met Thr Leu Pro Thr Pl\e Leu Arg 

165 170 \l75 

50 Arg Asp Phe Cys Arg Phe Lys lie Gly Phe Phe Leu His Ser Pr^ Phe 

180 185 190 



Pro Ser Ser Glu Val Tyr Lys Thr Leu Pro Met Arg Asn Glu Leu Lei 
195 200 205 



34 



L$s Gly Leu Leu Asn Ala Asp Leu He Gly Phe His Thr Tyr Asp Tyr 
210 215 220 

5 Ala \rg His Phe Leu Thr Cys Cys Ser Arg Met Phe Gly Leu Asp His 
225 \ 230 235 240 

Gin Leu\Lys Arg Gly Tyr He Phe Leu Glu Tyr Asn Gly Arg Ser He 
245 250 255 



10 



Glu He L^s He Lys Ala Ser Gly He His Val Gly Arg Met Glu Ser 
260 265 270 



Tyr Leu Ser Gin Pro Asp Thr Arg Leu Gin Val Gin Glu Val Gin Lys 
15 275 \ 280 285 

Arg Ser Lys GluVtle Val Leu Leu Gly Val Asp Asp Leu Asp He Phe 
290 \ 295 300 

20 Lys Gly Val Asn Phek Lys Val Leu Ala Leu Glu Lys Leu Leu Lys Ser 
305 \l0 315 320 



His Pro Ser Trp Gin GlV Arg Val Glu Lys Val Gin He Leu Asn Pro 
325 \ 330 335 

eu Arg Arg Cys Gin Asp V^i Asp Glu He Asn Ala Glu He Arg Thr 
340 \ 345 350 

Val Cys Glu Arg He Asn Asn GiVi Leu Gly Ser Pro Gly Tyr Gin Pro 
30 355 360\ 365 

Val Val Leu He Asp Gly Pro Val S^t Leu Ser Glu Lys Ala Ala Tyr 
370 375 \ 380 

35 Tyr Ala He Ala Asp Met Ala He Val Th\ Pro Leu Arg Asp Gly Leu 
385 390 ^395 400 



40 



Asn Leu He Pro Tyr Glu Tyr Val Val Ser Ar&Gln Ser Val Asn Asp 
405 410 \ 415 

Pro Asn Pro Asn Thr Pro Lys Lys Ser Met Leu Va\ Val Ser Glu Phe 
420 425 \ 430 



He Gly Val Ser Leu Ser Leu Thr Gly Ala He Arg Va\Asn Pro Trp 
45 435 440 445 

Asp Glu Leu Glu Thr Ala Glu Ala Leu Tyr Asp Ala Leu Met\Ala Pro 
450 455 460 

50 Asp Asp His Lys Glu Thr Ala His Met Lys Gin Tyr Gin Tyr IleVlle 
465 470 475 \80 



Ser His Asp Val Ala Asn Trp Ala Ser Phe Phe Gin Asp Leu Glu 
485 490 495 



Gln\ 



V 



35 



10 



15 



20 




Via Cys lie Asp His Ser Arg Lys Arg Cys Met Asn Leu Gly Phe Gly 
500 505 510 

Leu\Asp Thr Arg Val Val Phe Leu Met Arg Ser Leu Ala S r Trp lie 
515 520 525 

Lys Me\ Ser Trp Lys Asn Ala Tyr Ser Met Ala Gin Asn Arg Ala lie 
530\ 535 540 

Leu Leu A^p Tyr Asp Gly Thr Val Thr Pro Ser He Ser Lys Ser Pro 
545 \ 550 555 560 

Thr Glu Ala \^1 He Ser Met He Asn Lys Leu Cys Asn Asp Pro Lys 
565 570 575 

Asn Met Val Phe Vie Val Ser Gly Arg Ser Arg Glu Lys He Leu Ala 
580 \ 585 590 

Val Gly Ser Ala ArgVval Arg Thr Arg His Cys Thr Glu His Gly Tyr 
595 \ 600 605 

Phe He Arg Trp Ala Gl\ Asp Gin Glu Trp Glu Thr Cys Ala Arg Glu 
€10 \l5 620 



sn Val Gly Trp Met Asp, Gly Asn Leu Arg Pro Val Met Asn Leu 
630 \ 635 640 



yr Thr Glu Thr Thr Asp Gly SerXTyr He Glu Lys Lys Glu Thr Ala 
645 \ 650 655 



40 



45 



Met Val Trp His Tyr Glu Asp Ala AspNLys Asp Leu Gly Leu Glu Gin 
660 665 \ 670 

Ala Lys Glu Leu Leu Asp His Leu Glu AsnVal Leu Ala Asn Glu Pro 
675 680 \ 685 

Val Gly Val Asn Arg Thr Gly Gin Tyr He Val blu Val Lys Pro Gin 
690 695 70t 

Ser Pro He Asn Tyr Leu Leu Val Met Thr Phe He (sStf Thr Asp Cys 
705 710 715 \ 720 

Arg He Phe Asn Leu Asn Phe Phe Lys Tyr Glu Cys Asn T$sr Arg Gly 
725 730 \735 

Ser Leu Lys Gly He Val Ala Glu Lys lie Phe Ala Phe Met AJ^ Lys 
740 745 750 



50 Lys Gly Lys Gin Ala Asp Phe Val Leu Thr Leu Asn Asp Arg Ser 

755 760 765 

Glu Asp Met Phe Val Ala He Gly Asp Gly He Lys Lys Gly Arg He 
770 775 780 



36 



Thr Asn Asn Asn Ser Val Phe Thr Cys Val Val Gly Glu Lys Pro Ser 
785 790 795 800 

5 \Ala Ala Glu Tyr Phe Leu Asn Asp Val Ser Arg Ser Ser Gly Cys Leu 



10 



15 



20 



35 



40 



45 



805 

Se\ Asn Gin Gly 
820 

(2) INFORMATION FOR SEQ ID NO: 3: 

( i ) \eQUENCE CHARACTERISTICS : 
LENGTH: 25 base pairs 
TYPE: nucleic acid 
(CK STRANDEDNESS: single 
(D)\oPOLOGY: linear 



810 



815 




(ii) MOLECULE: TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AAGCTTATGT TGCCATATAG AGTAG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTAGTTGCCA TGGTGCAAAT GTTC 



25 



24 



50 



37 



10 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 
(iii\ HYPOTHETICAL: YES 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGCTCTGCAG TGAGGTACCA 
(2) INFORMATION eW SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :^20 base pairs 

(B) TYPE: nuaS^eic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lifc^ear 

(ii) MOLECULE TYPE: DNA genomic) 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ IiyiO: 6: 

35 GACGTCACTC CATGGTTCGA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 



20 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTACCCTGCA GTGTGACCCT AGAC 



24 



38 



10 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<\i) MOLECULE TYPE: DNA (genomic) 
(iii)\HYPOTHETICAL: YES 



20 




15 (xi) SEQUErTCE DESCRIPTION: SEQ ID NO: 8: 

TCGATTCATA GAAGCT^AGA T 

(2) INFORMATION FOr\eQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2207 base pairs 
<B) TYPE: nucle\c acid 

(C) STRANDEDNESS\ double 

(D) TOPOLOGY: lin^ 

(ii) MOLECULE TYPE: cDNA t\ mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosX 
35 (B) STRAIN: Kardal 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 161.. 1906 

40 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 842.. 850 

(D) OTHER INFORMATION: /function= "putati\e 
45 glycosylationsite" 



21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
50 CTTTTCTGAG TAATAACATA GGCATTGATT TTTTTTCAAT TAATAACACC 
CCCATTGCCG GCATTCTCTG TTCTTACAAA AAAAAACATT TTTTTGTTCA 



CATAAA" 



CATT 
TAG 



60 
120 



39 



TTATGGCATC AGTATTGAAC CCTTTAACTT GTTATACAAT ATG GGT AAA GCT ATA 

Met Gly Lys Ala He 
1 5 



175 



ATT\TTT ATG ATT TTT ACT ATG TCT ATG AAT ATG ATT AAA GCT GAA ACT 
He \he Met He Phe Thr Met Ser Met Asn Met He Lys Ala Glu Thr 
10 15 20 



223 



TGC AAA^CC ATT GAT AAG GGT CCT GTA ATC CCA ACA ACC CCT TTA GTG 
10 Cys Lys Bfr lie Asp Lys Gly Pro Val He Pro Thr Thr Pro Leu Val 
25 30 35 



271 



15 



ATT TTT CTT 
He Phe Leu 
40 



JAA AAA GTT CAA GAA GCT GCT CTT CAA ACT TAT GGC CAT 
Lys Val Gin Glu Ala Ala Leu Gin Thr Tyr Gly His 
45 50 



319 



AAA GGG TTT GAT <$CT AAA CTG TTT GTT GAT ATG TCA CTG AGA GAG AGT 



20 



Lys Gly Phe Asp Al^i Lys 
55 




CTT TCA GAA ACA GTT G 
Leu Ser Glu Thr Val 



Glu* 
75 



Leu Phe Val Asp Met Ser Leu Arg Glu Ser 
60 65 

GCT TTT AAT AAG CTT CCA AGA GTT GTG AAT 
s^la Phe Asn Lys Leu Pro Arg Val Val Asn 
80 85 



T TCA ATA TCA AAA AGT GAT 
Gly Ser He Ser Lys Ser Asp 
90 



GAT GGT TTT ATA GGT AGT TAC TTG 
Asp Gly Phe He Gly Ser Tyr Leu 
95 100 



AGT AGT CCT GAT AAG GAT TTG GTT TA^L GTT GAG CCT ATG GAT TTT GTG 
Ser Ser Pro Asp Lys Asp Leu Val Tyr Val Glu Pro Met Asp Phe Val 
105 110 \ 115 



35 



GCT GAG CCT GAA GGC TTT TTG CCA AAG GTG 
Ala Glu Pro Glu Gly Phe Leu Pro Lys Val 
120 125 



\G AAT TCT GAG GTG AGG 
Asn Ser Glu Val Arg 
130 



367 



415 



463 



511 



559 



40 



GCA TGG GCA TTG GAG GTG CAT TCA CTT TGG AAG AAT TTA AGT AGG AAA 607 
Ala Trp Ala Leu Glu Val His Ser Leu Trp Lys Asn\Leu Ser Arg Lys 
135 140 145 

GTG GCT GAT CAT GTA TTG GAA AAA CCA GAG TTG TAT ACtNtTG CTT CCA 655 
Val Ala Asp His Val Leu Glu Lys Pro Glu Leu Tyr Thr lVi Leu Pro 
150 155 160 \ 165 



45 TTG AAA AAT CCA GTT ATT ATA CCG GGA TCG CGT TTT AAG GAG GfT TAT 
Leu Lys Asn Pro Val He He Pro Gly Ser Arg Phe Lys Glu ValNfyr 
170 175 180 



703 



TAT TGG GAT TCT TAT TGG GTA ATA AGG GGT TTG TTA GCA AGC AAA ATG 
50 Tyr Trp Asp Ser Tyr Trp Val He Arg Gly Leu Leu Ala Ser Lys Met 
185 190 195 



40 



10 



15 



35 



40 



45 



50 



\TAT GAA ACT GCA AAA GGG ATT GTG ACT AAT CTG GTT TCT CTG ATA GAT 
(r Glu Thr Ala Lys Gly He Val Thr Asn Leu Val Ser Leu He Asp 
200 205 210 

CAA TfT GGT TAT GTT CTT AAC GGT GCA AGA GCA TAC TAC AGT AAC AGA 
Gin Phe>Gly Tyr Val Leu Asn Gly Ala Arg Ala Tyr Tyr Ser Asn Arg 
215 \ 220 225 

AGT CAG CCTiSCT GTC CTG GCC ACG ATG ATT GTT GAC ATA TTC AAT CAG 
Ser Gin Pro P^o Val Leu Ala Thr Met He Val Asp He Phe Asn Gin 
230 \ 235 240 245 

ACA GGT GAT TTA AAT TTG GTT AGA AGA TCC CTT CCT GCT TTG CTC AAG 
Thr Gly Asp Leu AsrkLeu Val Arg Arg Ser Leu Pro Ala Leu Leu Lys 
250 \ 255 260 

GAG AAT CAT TTT TGG AA\ TCA GGA ATA CAT AAG GTG ACT ATT CAA GAT 




Glu Asn His Phe Trp Asn 
265 



Ser Gly He His Lys Val Thr He Gin Asp 
270 275 



AG GGA TCA AAC CAC AGC 
Gin Gly Ser Asn His Ser 
280 



CTG AGT CGG TAC TAT GCT ATG TGG AAT 
Ser Arg Tyr Tyr Ala Met Trp Asn 
290 



GCT GCT GAA AGT GGA TGG GAT TTC AGT TCA AGA TGG ATG AGC AAC GGA 
Ala Ala Glu Ser Gly Trp Asp Phe Ser Ser Arg\rp Met Ser Asn Gly 
330 335 \ 340 



TCT GAT CTG ACA ACA ACT AGT ACA ACA TCA ATT CTA> 
Ser Asp Leu Thr Thr Thr Ser Thr Thr Ser He Leu 
345 350 



GTT GAT TTG 
ro Val Asp Leu 
355 



AAT GCA TTC CTT CTG AAG ATG GAA CTT GAC ATT GCC TTT' 
Asn Ala Phe Leu Leu Lys Met Glu Leu Asp He Ala Phe 
360 365 370 



TA GCA AAT 
u Ala Asn 



CTT GTT GGA GAA AGT AGC ACG GCT TCA CAT TTT ACA GAA GCT 
Leu Val Gly Glu Ser Ser Thr Ala Ser His Phe Thr Glu Ala 
375 380 385 



T CAA 
a Gin 



AAT AGA CAG AAG GCT ATA AAC TGT ATC TTT TGG AAC GCA GAG ATG 
Asn Arg Gin Lys Ala He Asn Cys He Phe Trp Asn Ala Glu Met Gl 
390 395 400 40! 



1087 




CCC CGT CCA GAA TCG TCA ACT £TA GAC AGT GAA ACA GCT TCC GTA 

ro Arg Pro Glu Ser Ser Thr IlW Asp Ser Glu Thr Ala Ser Val 
95 300 \ 305 

CCA AAT ATA TGT GAA AAA AGA GAA TTk TAC CGT GAA CTG GCA TCA 

ro Asn He Cys Glu Lys Arg Glu Leu\Tyr Arg Glu Leu Ala Ser 

315 320 325 



1183 



1279 



41 



10 



15 



20 



35 



40 



45 



50 



^CAA TGG CTT GAT TAC TGG CTT ACC AAC AGC GAC 
fin Trp Leu Asp Tyr Trp Leu Thr Asn Ser Asp 
410 415 

TA* AAA TGG GAA GAT TTG CAC CAG AAC AAG AAG 
Tyi\ Lys Trp Glu Asp Leu His Gin Asn Lys Lys 
425 430 

TTT GTT CCG CTG TGG ACT GAA ATT TCT TGT TCA 
Phe V^l Pro Leu Trp Thr Glu lie Ser Cys Ser 
440 445 



ACT CAG 
Thr Gin 
455 



GTA GTT CAA AGT CTC ATG AGC TCG 
Val Val Gin Ser Leu Met Ser Ser 
460 




GCA GGG AT* GCA ATG ACC TTG TCT AAT ACT GGA 
Ala Gly IleVAla Met Thr Leu Ser Asn Thr Gly 
470 \ 475 480 

CCG AAT GGT TG&CCC CCC CTT CAA CAC ATA ATC 
sn Gly TrpN^ro Pro Leu Gin His lie lie 
0 495 

TCT GGA CTA GAA\GAG GCA AGA ACC TTA GCA 
Ser Gly Leu Glu Glu Ala Arg Thr Leu Ala 
505 \ 510 

TGG TTA AGA ACT AAC ^AT GTG ACT TAC AAG 
Trp Leu Arg Thr Asn Tyr Val Thr Tyr Lys 
520 \525 

GAA AAA TAT GAT GTC ACA A2& TGT GGA GCA 
Tyr Glu Lys Tyr Asp Val Thr LysNfys Gly Ala 
535 540 



ACA TCT GAG GAT ATT 
Thr Ser Glu Asp lie 
420 

TCA TTT GCC TCT AAT 
Ser Phe Ala Ser Asn 
435 

GAT AAT AAT ATC ACA 
Asp Asn Asn lie Thr 
450 

GGC TTG CTT CAG CCT 
Gly Leu Leu Gin Pro 
465 

CAG CAA TGG GAT TTT 
Gin Gin Trp Asp Phe 
485 

ATT GAA GGT CTC TTA 
lie Glu Gly Leu Leu 
500 

AAA GAC ATT GCT ATT 
Lys Asp lie Ala lie 
515 

AAA ACC GGT GCT ATG 
Lys Thr Gly Ala Met 
530 

TAT GGA GGT GGT GGT 
Tyr Gly Gly Gly Gly 
545 



GAA TAT ATG TCC CAA ACG GGT TTC GGA 
Glu Tyr Met Ser Gin Thr Gly Phe Gly 
550 555 

GCA CTT CTA GAG GAA TTT GGA TGG CCT GAA 
Ala Leu Leu Glu Glu Phe Gly Trp Pro Glu Asp^ 
570 575 

TAATGAGCAA GTAGAAAAGC CAAATGAAAC ATCATTGAGT 

AAATAAGCTG CAATGGTTTG CTGATAGTTT ATGTTTTGTA 

TGTACCATAT CAAGTGATAT TACCATGAAC TATGTCGTTC 

TGCAAAAATA ATGCAGTTTT GGAGAATCCG ATAACATAGA 

TAAACAGCTT ACTATATTAA GTAAAAGAAA GATGATTCCT 



AAT GGC GTT GTA CTG 
Asn Gly Val Val Leu 
565 

TTG AAG ATT GAT TGC 
Leu Lys lie Asp Cys 
580 

tttXttttct tcttttgtta 
ttacta^ttc ataaggtttt 
ggactcttoa aatcggattt 

CCATGTATGG AffCTAAATTG 
CTGCTTTAAA 



1423 



1471 



1519 



1567 



1615 



1663 



1711 



1759 




1963 
2023 
2083 
2143 
2203 



42 



AAAA 



2207 



10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

a) MOLECULE TYPE: protein 

(xi\ SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



15 Met Gly Lyk Ala He He Phe Met He Phe Thr Met Ser Met Asn Met 
1 \ 5 10 15 



He Lys Ala Glti Thr Cys Lys Ser He Asp Lys Gly Pro Val He Pro 
20\ 25 30 



r Pro Leu VaX lie Phe Leu Glu Lys Val Gin Glu Ala Ala Leu 
35 \ 40 45 



In Thr Tyr Gly His Lys x 
50 



X51y Phe Asp Ala Lys Leu Phe Val Asp Met 



Ser Leu Arg Glu Ser Leu Ser 
65 70 



60 



51u Thr Val Glu Ala Phe Asn Lys Leu 



75 



80 



Pro Arg Val Val Asn Gly Ser He 
85 



Ser Lys Ser Asp Leu Asp Gly Phe 



90 



95 



lie Gly Ser Tyr Leu Ser Ser Pro Asp 
100 105 



is Asp Leu Val Tyr Val Glu 



35 



Pro Met Asp Phe Val Ala Glu Pro Glu Gly PI 
115 120 



110 



Leu Pro Lys Val Lys 
125 



40 



Asn Ser Glu Val Arg Ala Trp Ala Leu Glu Val Hi\Ser Leu Trp Lys 
130 135 140 



Asn Leu Ser Arg Lys Val Ala Asp His Val Leu Glu Lys^Pro Glu Leu 
145 150 155 \ 160 

45 Tyr Thr Leu Leu Pro Leu Lys Asn Pro Val He lie Pro Gly 

165 170 11 



50 



Phe Lys Glu Val Tyr Tyr Trp Asp Ser Tyr Trp Val lie Arg Gly 

180 185 190 

Leu Ala Ser Lys Met Tyr Glu Thr Ala Lys Gly He Val Thr Asn Leu 

195 200 205 



43 



v Val Ser Leu lie Asp Gin Phe Gly Tyr Val Leu Asn Gly Ala Arg Ala 
210 215 220 

Tyk Tyr Ser Asn Arg Ser Gin Pro Pro Val Leu Ala Thr Met lie Val 
225 \ 230 235 240 

Asp Il^Phe Asn Gin Thr Gly Asp Leu Asn Leu Val Arg Arg Ser Leu 
245 250 255 

10 Pro Ala Le\Leu Lys Glu Asn His Phe Trp Asn Ser Gly lie His Lys 
>60 265 270 



15 



Val Thr lie GlhyAsp Ala Gin Gly Ser Asn His Ser Leu Ser Arg Tyr 
275 \ 280 285 

Tyr Ala Met Trp AsX Lys Pro Arg Pro Glu Ser Ser Thr lie Asp Ser 
290 \ 295 300 



Glu Thr Ala Ser Val Le\ Pro Asn lie Cys Glu Lys Arg Glu Leu Tyr 
20 305 310\ 315 320 




u Leu Ala Ser Ala AX© Glu Ser Gly Trp Asp Phe Ser Ser Arg 
325 \ 330 335 



rp Met Ser Asn Gly Ser Asp Lfcu Thr Thr Thr Ser Thr Thr Ser lie 
340 \345 350 



Leu Pro Val Asp Leu Asn Ala Phe L^u Leu Lys Met Glu Leu Asp lie 
355 360 \ 365 

Ala Phe Leu Ala Asn Leu Val Gly Glu &er Ser Thr Ala Ser His Phe 
370 375 \ 380 



35 



Thr Glu Ala Ala Gin Asn Arg Gin Lys Ala He Asn Cys He Phe Trp 
385 390 385 400 



Asn Ala Glu Met Gly Gin Trp Leu Asp Tyr TrpYeu Thr Asn Ser Asp 
405 410 \ 415 

40 Thr Ser Glu Asp He Tyr Lys Trp Glu Asp Leu HisNsin Asn Lys Lys 
420 425 \ 430 



45 



Ser Phe Ala Ser Asn Phe Val Pro Leu Trp Thr Glu He ^er Cys Ser 

435 440 445 

Asp Asn Asn He Thr Thr Gin Lys Val Val Gin Ser Leu Met S^r Ser 

450 455 460 



50 



Gly Leu Leu Gin Pro Ala Gly He Ala Met Thr Leu Ser Asn Thr GJt 
465 470 475 



Gin Gin Trp Asp Phe Pro Asn Gly Trp Pro Pro Leu Gin His He He 
485 490 495 



44 



II Glu Gly Leu Leu Arg Ser Gly Leu Glu Glu Ala Arg Thr Leu Ala 
500 505 510 

L.ys Asp He Ala He Arg Trp Leu Arg Thr Asn Tyr Val Thr Tyr Lys 
5 \ 515 520 525 

Lys\Thr Gly Ala Met Tyr Glu Lys Tyr Asp Val Thr Lys Cys Gly Ala 
S30 535 540 

10 Tyr GlV Gly Gly Gly Glu Tyr Met Ser Gin Thr Gly Phe Gly Trp Ser 
545 \ 550 555 560 



15 



Asn Gly v\l Val Leu Ala Leu Leu Glu Glu Phe Gly Trp Pro Glu Asp 
565 570 575 

Leu Lys He Aep Cys 



20 (2) INFORMATION FO\ SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS* single 

(D) TOPOLOGY: lin? 



(ii) MOLECULE TYPE: DNA (gd^omic) 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 
35 (B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base^\ i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 
40 (B) LOCATION: 15 

(D) OTHER INFORMATION: /mod base= i 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGYGGNMGMT TYRWNGARKT MTAYKRYTGG GAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



33 



45 



<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



10 



15 



20 




35 



(ix) FEATURE: 

(A) NAME/KEY: modified. 

(B) LOCATION: 3 
(D) OTHER INFORMATION: 

(i\) FEATURE: 

(A) NAME/KEY: modified_ 
<B) LOCATION: 6 
)) OTHER INFORMATION: 

(ix) FEAT 1 

(A) JftaiE/KEY: modified^ 

(B) LOCATION: 9 

(D) OTHE^ INFORMATION: 

x) FEATURE: 

(A) NAME/KEY\ modified^ 

(B) LOCATION: \2 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME /KEY: modified 

(B) LOCATION: 15 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: modified^ 

(B) LOCATION: 21 

(D) OTHER INFORMATION: 



base 

/mod_base= i 
base 

/mod_base= i 
base 

/mod_base= i 
base 

/mod__base= i 

base 
mod_base= i 

base> 
/mod ba^e= i 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 ?S 
40 GTNCCNGGNG GNCGNTTYRW NGARKT 

(2) INFORMATION FOR SEQ ID NO: 13: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



26 



50 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) 



HYPOTHETICAL: YES 



46 



10 



15 




35 



40 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(aKname/KEY: modifiedjbase 
<B) \oCATION: 12 

(D) OT^ER INFORMATION: /mod_Jbase= i 

(ix) FEATURE: 

(A) NAME/&EY: modif ied_base 

(B) LOCATION 15 

(D) OTHER INFORMATION: /modJbase= i 



) FEATURE : 

(A) NAME/KEY: modl£ ied_J>ase 

(B) LOCATION: 18 
(D) OTHER INFORMATION: /mod base= i 



(xi) SEQUENCE DESCRIPTION: SEQ\D NO: 13: 
GGNGGYTGNS WNCGNYRNAG RTARTA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



26 



(ix) FEATURE: 
45 (A) NAME /KEY: modifiedjbase 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod base= 



(ix) FEATURE: 
50 (A) NAME/KEY: modifiedjbase 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod base= 



10 



15 



20 




47 



(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied__base 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /mod_base= i 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
NSCRTTNRYC CA^CRAANC CNTC 
(2) INFORMATION E^R SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :\o base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES6: single 

(D) TOPOLOGY: linear 

i) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



15: 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NC 
CGAAACGGGC CCATCAATTA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



24 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
50 TCGATGAGAT CAATGCCGAG 



48 



\ (2) INFORMATION FOR SEQ ID NO: 17: 

\ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 



15 (xi) SEQUENCE^OESCRIPTION: SEQ ID NO: 17; 

CCATCCTAAT ACGACTCACT\ATAGGGC 
(2) INFORMATION FOR SEQ NO: 18: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 baseVairs 

(B) TYPE: nucleic aciii 

(C) STRANDEDNESS: singf* 

(D) TOPOLOGY: linear 



27 




(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

35 CACAACAGGC TGGTATCCCG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAATAACGAA CTGGGAAGCC 



20 



49 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



15 (xi) ^SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ACTCACTATA\GGGCTCGAGC GGC 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtoEDNESS : single 

(D) TOPOLOGY: linear 



23 




(ii) MOLECULE TYPB^: cDNA 
iii) HYPOTHETICAL: 



(ix) FEATURE: 

(A) NAME /KEY: modii^ed base 

(B) LOCATION: 4 
35 (D) OTHER INFORMATION /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modified^ 

(B) LOCATION: 6 
40 (D) OTHER INFORMATION: /modJ^ase= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 9 
45 (D) OTHER INFORMATION: /mod base= i x 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 15 

50 (D) OTHER INFORMATION: /mod_base= i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAYNTNATNT GGRTNCAYGA YTAYCA 



26 



50 , 



(2) INFORMATION FOR SEQ ID NO: 22: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) ^OLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



15 (ix) FEATURE] 

(A) m 

(B) LOCAT3 
(D) OTHER 



35 



40 



45 



50 



modi f iecMba se 
3 

PORMATION: /mod base= i 




(ix) FEATURE: 

(A) NAME/KEY: mbU^f iedjsase 

(B) LOCATION: 6 

(D) OTHER INFORMATl6^: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_b^e 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_^ase= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 18 
(D) OTHER INFORMATION: /mod base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCNACNGTRC ANGCRAANAC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



20 



51 



10 



15 




30 



\(ix) FEATURE: 

(A) NAME /KEY : modified^ 

(B) LOCATION: 2 
(D) OTHER INFORMATION: 

(ix\ FEATURE: 

(A) NAME/KEY: modified, 

(B) LOCATION: 5 
(D) OTHER INFORMATION; 

(ix) FEATURE 

i) NAME/KEY: modified^ 
(fe) LOCATION: 8 
(D\ OTHER INFORMATION; 

(ix) FEAT 

(A) lifcME/KEY: modified^ 

(B) LOCATION: 14 
(D) OTHpR INFORMATION: 

(ix) FEATURE 

(A) NAME/kEY: modified 

(B) LOCATION: 20 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY :\modified 

(B) LOCATION: 123 
(D) OTHER INFORMATION; 



base 

/mod_base= i 
base 

/mod_base= i 
base 

/mod_Jbase= i 
base 

/mod_base= i 
base 

/mod_base= i 
base 

/mod base= i 



(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 23: 

35 TNGGNTKNTT YYTNCAYAYN CCNTTYCC 

(2) INFORMATION FOR SEQ ID NO: 2' 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



28 



50 (ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod base= i 



52 



10 



15 




(ix) FEATURE: 

(A) NAME/KEY: modif iedjoase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

^x) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 

(xi) SEQUELS DESCRIPTION: SEQ ID NO: 24: 
TGRTCNARNA RYTCYTT^C 
(2) INFORMATION FOR SfeQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 r}^se pairs 

(B) TYPE: nucl 

(C) STRANDEDNESS: ^ngle 

(D) TOPOLOGY: linear 

li) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 
35 (A) NAME/KEY: modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /modJbase= i 

(ix) FEATURE: 
40 (A) NAME/KEY: modif ied_base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 
45 (A) NAME/KEY: modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CCRTGYTCNG CNSWNARNCC 



20 



53 

(2) INFORMATION FOR SEQ ID NO: 26: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



f ix) FEATURE : 
15 \ (A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 
20 (^4 NAME/KEY: modif ied_base 

(B)\LOCATION: 17 

(D) OTHER INFORMATION: /mod base= 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCRTCNGTRA ARTCRTCNCC 
(2) INFORMATION FOR SEQ l\NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base 

(B) TYPE: nucleic acic 

(C) STRANDEDNESS: singl^ 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



20 



(ix) FEATURE: 

(A) NAME /KEY: modif iedjbase 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 15 



